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Abstract 



Progressive values of two probabilities are obtained for parameter estimates derived from 
an existing set of values and from the same set enlarged by one or more new values, respectively. 
One probability is that of erroneously preferring the second of these estimates for the existing 
data ("type I error"), while the second probability is that of erroneously accepting their estimates 
for the enlarged test ("type II error"). A more stable combined "no change” probability which 
always falls between 0.5 and 0 is derived from the (logarithmic) width of the uncertainty region 
of an equivalent "inverted" sequential probability ratio test (SHIT, Wald 1945) in which the error 
probabilities are calculated rather than prescribed. 

A parameter change is indicated when the compound probability undergoes a progressive 
decrease. The tests is explicitly formulated and exemplified for Gaussian samples. 
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Inverse sequential detection of parameter changes in developing time series 
by Uwe Radok and Timothy J. Brown 


1. Introduction 


The series will be assumed to consist of values from a population with probability function 

/(x)=p(x;0;0';0"; •••)<& ( 1 ) 

The existing, or most recent, m values of the series provide parameter estimates 0 m ; 0" m ; 

which change to 0 m+7 ; 0V*-/ J 0V+/! * • * after a further 1,2, • • • J values have been added 
to the series. These parameter estimates define four different likelihoods: 


Lm(rn) = Y[p m 


l 


Vi+/ ) = IT Pm+j 

I 


( 2 ). 


m+j 

L m+J (rn+j)=Ylp m+j 
1 


m+j 

Lm(m+j)=Jlp m 
i 


Throughout bracketed symbols indicate the numbers of values used to calculate the likelihoods, 
and subscripts the numbers of values used for the parameter estimates. 


The four equations (2) have been combined for a formal test of the hypotheses H(m): 
0 = 0„,0' = 0' m ,0"=0" m • • • , and tf(m+y): 0 = 0 m+J , O' = e M+j , 0" = B" mj • -.intennsof 
the Neyman-Pearson theory (see e.g., Hoel 1962, chapter 9). this showed (Radok and Brown, 
submitted) that the likelihood ratios q(m)=L m+J (rn)/L m (m) and 
q (m +j ) = L m+ j (m +j )!L m (m +j ) can be estimated by the well-known decision limits of Wald’s 
(1945) "sequential probability ratio test (SPRT)": 


q(m) = 


P 

1 -a 


q(m+j)= 

a 


(3) 


where a is the type I error probability of rejecting the estimates 0 m , 0' m , 0" m • • • for the exist- 
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ing m values, and P is the type II error probability of accepting the same estimates for the aug- 
mented set of m+j values; (3 also represents the type I error of not accepting the estimates 0 m+7 , 
0' m+7 , 0" m+7 ; for that augmented set of values. Our test in effect places the likelihood ratio q(m) 
on Wald’s lower deicision limit, and the ratio q(m+j) on upper limits which change as further 
values become available. But in contrast to the SPRT the "inverse" procedure involves known 
likelihood ratios and unknown probabilities which can be found from (3) as 


a = (l - q(m))/(q(m +j)-q(.my) 

$ = (q(m+j)q(m)-q (m ))/(<? (m +j)-q (m )) 


(4) 


Changes in one or several of the parameters are indicated when these error probabilities 
show decreasing trends. New parameter values can then be estimated from the new values alone. 
Since the individual probabilities can be somewhat irregular due to rounding-off errors, trends 
become more clearly visible in their average or in a compound "no-change" probability y defined 
by the logarithmic width of the SPRT’s indecision region, 


log, [(1 - P)/a] - log, [p/(l - a)] = log, [q (m+j)lq (m )] = log. Q ; 

writing this as 


log,Q = log. 



(5) 


leads to 


T-o+^rr 1 (6) 

y falls between 0 and 0.5 as long as q(m+j) > ( q(jn )) and between the arithmetic and 


geometric averages of a and p (see appendix A). 
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2. Inverse sequential formulae for Gaussian means and variances. 

The basic probability for the Gaussian distribution, 

P = (2jco 2 r ,/2 exp[- C x - it ?I2<S^ (7) 

involves two parameters (p, a 2 ) which can only be tested jointly since in the present context nei- 
ther is known a priori. We use the sample mean as estimate for the population mean p, and 
o 2 = [(n/(n - 1)] s 2 where s 2 is the sample variance and n (= m or m+j) is the number of values 
used. The products (2) are converted to sums by taking logarithms; thus 

log. £*,(/«)= -^ log, 2* - y log.o 2 - 1 (x - p^/ 20 2 ; (8) 

m 

with, £(x - p) 2 = (m - l)o£ the last term reduces to - (m - 1)/2. 

l 

Proceeding in the same way for (m ) leads to 

log Z*, +> (m ) - y log, 2k - y log, a 2 +j - £(* - p m+J ) 2 /2a 2 +j ; (9) 

with p m+; - - p m = Ap the numerator of the last term can be written 

-£(* + Ap)) 2 = -£(j -p m ) 2 -2(-2xAp + 2p wl Ap + Ap 2 ) 

t l i 

= ~(m - 1)0 2 -2rnp m Ap + 2^ip m Ap-rn(Ap) 2 


so that (9) becomes 


m 


, , , (m-l)o2 m (All) 3 

l°fci«,/=-ylOg2*-y log. O 3 ,; - 


' m+j 


(9’) 


Finally subtracting (8) from (9’) gives the log likelihood ratio 


i / \ i m , m - 1 

log, q(m) = log, ; - = — log, —5 — + -t— 


L m (m) 


} m+j 


,2 ) 


1 - 




/w(Ap) 2 

2o 2 +; - 


( 10 ) 
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Proceeding the same way for the augmented set of m +j values yields first 

log, m+j{m+j) = -^- log, 2 k i 0& o* + y - £(x -n m+> ) 2 /2o 2 + y (11) 


where the last term, with £(x - |x m+ y) 2 = ( m+j - 1) reduces to {m+j - l)/2. Next 


log,L m (m+/) = - log,2jt - log,o 2 - £(x - (M*+y - Ap)) 2 /2a 2 +; . (12) 


Expanding the last term as before yields 


i it L i\ m+ J , ^ m+/ , 2 («+/ ~ l)oiS+y (m +j )A| i 2 

log,Z, m (m+/) » - — ^ log, 2 k - log, o 2 = - - — V^ ~ (12’) 


2o 2 


Subtracting (12’) from (1 1) we obtain the second log likelihood radio 


i , ,,, m+j , °m m j — 1 

log , q (m+j)- —L- log, — r— + — *z 

2 q m+; 2 


J m+j 


Om 


-1 


+ ( m +/XAp.) 2 ( 13 ) 

2o 2 


The final formulae therefore are 


<7 (m ) = exp 


mlog. 


°m m-\ 
2 


1 - 


J m+; J 


771 


2o, 


'm+/ 


(Mm+y-Mm ) 3 


?(m+/) = exp 


(m+/)log. 


q m m+j — 1 

q m+l 2 


J m+j 




-1 


(14) 


+ T^(H* +V -Mm) 2 

2o 2 


The first two exponents in each formula reflect solely changes in variance, while the third 
exponents depend primarily on changes in die mean. 


3. Example. 

Fig. la shows a series of values drawn at random from Gaussian populations with different 
means and variances. The different probabilities are shown in fig. lb and suggest a change start- 
ing with the 7th value and accelerating after the 9th value, the actual beginning of a new section 
in "statistical control" (Shewhart 1939). A spurious change is suggested briefly after the 14th 
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value, but this is reversed until a real change becomes visible after the 26th value, just inside the 
next controlled section. It is noteworthy that these control changes are revealed even though the 
population parameters differed considerably from their (small) sample estimates employed by the 
test, as the only information available in practice. 

4. Conclusion. 

We plan to carry out more extensive experiments to establish the lull properties of the 
inverse sequential procedure, and to test its efficacy on different geophysical time series. Addi- 
tional parameters for which the procedure has been formulated, but not yet adequately tested, 
include Poisson means (variances), chi-square means (representing also variances and degrees of 
freedom) and linear and exponential trends. These will be reported in a further paper. 
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gram. 
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Captions: 

Fig. 1 : An inverse sequential detection of parameter changes. 

a) Values drawn at random from three Gaussian populations with means and vari- 
ances indicated. 

b) Probabilities that no parameter changes are occurring. 


For symbols see text 
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Appendix A: The no-change probability y 
The probability y as defined by equation (6) is, 

Y=0 + V0T 1 , 

and lies between the arithmetic and geometric means of the probabilities a and p. This can be 
shown by alternatively substituting these means for a and P in the expanded equation (Al), 


(Al) 


_ _ [ 1 - (ct+P) + aP] 
u ap 


(A2) 


When a = P = (a + P)/2, equation (A3) becomes 


Q i = 


r, ^ . c^+p^ 


2 





(A3) 


This shows that the numerator N and the denominator D of Q both have been increased by 
e = (a 2 + P 2 )/2 > 0. Since Q > 1 (i.e., N > D ), then Q =N/D >Qi = (N +e)/(D +e) since 
ND +Ne> ND + De,otN > D, the initial condition. 

Again with a = P = (aP) 44 , equation (A3) becomes 


Q 2 = 


1 -2(ap)' /i + ap 


ap 


(A4) 


so that Q 2 ~Q = - 2(aP) ,/4 + (a + P)>0 since a + P>2(aP) 44 ; this can be seen by squaring 
both sides giving 


(a+P) 2 + a? + p 2 + 2ap>4ap,or (a-*P) 2 >0 
Finally, with Q 2 >Q >Q 1 , 


(A5) 


^1 + \]Q2 I ~ Y ftomttric < Y — ^ * < "larithmttic ~ ^ lj • (A6) 








